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HUMAN INTER-ALPHA-TRYPSlN INHIBITOR GENE 



Background of the Invention 

Inter-alpha-trypsin inhibitor (TT1) is a serine 
protease inhibitor found in human plasma and se- 
rum. This protein has been reported to inhibit the 
proteolytic enzymes trypsin, chymotrypsin and 
neutrophil elastase. However, the specific target of 
the inhibitor is unknown, m is unique among plas- 
ma proteinase inhibitors in that It occurs as more 
than one molecular weight species. It is found as a 
high molecular weight form of 180 kilodattons (kd) 
and as a lower molecular wefght species of 30 kd. 
This lower molecular weight species has been des- 
ignated HI-30. HI-30 can be released from the high 
molecular weight form by treatment with trypsin. 
The HI-30 fragment contains all of the anti-pnot- 
eoiytic activity of the high molecular weight form. 
However, it has not been demonstrated directly 
that the 180 kd protein Is the precursor of the Hi-30 
protein found in plasma, ft Is possible that HI-30 is 
produced independently of the 180 kd form. 

The total structure of high molecular weight IT) 
is unknown. However, the degradation product, HI* 
30, has been purified from human urine and the 
complete amino acid sequence determined. HI-30 
Is composed of 50% carbohydrate and contains 
two tandemly linked antj-proteolytic domains. Both 
domains are structurally related to the Kunitz family 
of low molecular weight serine protease inhibitors. 

Although 180 kd ITJ has been described as a 
single chain glycoprotein, recent experiments in- 
dicate there are multiple messenger RNAs 
(mRNAs) in baboon liver coding for polypeptides 
which compose m. One of these mRNAs codes for 
a 42 kd protein, termed the m light chain. This 
protein reacts with antibodies specific for HI-30. 
See Bourguinon et a!, 1983. A complementary DMA 
(cONA) done of the m light chain has been Iso- 
lated by others from a human liver mRNA library 
and partially sequenced to the Inhibitory portion of 
the second domain. This work, In contrast to amino 
acid sequence data, showed that HI-30 is at the 
carboxy-tenminus of the IT1 light chain. See Bour- 
guinon et al, 1985. 

Using gene cloning techniques, a cDNA copy 
of the gene for the ITI tight chain which contains 
the HI-30 protein coding region was Isolated from a 
human liver mRNA library. The entire nucleotide 
sequence of the longest clone (identified by colony 
hybridization) was determined. This confirmed that 
the gene for HI-30 is at the 3* end of the gene, 
which corresponds to the carboxy terminus of the 
protein. The amino acid structure (which can be 
derived from the nucleotide sequence by transla- 



tion of the genetic, code) was generally In accord . 
with data available from the previously reported HI- 
30 protein sequence. However, it has now been 
discovered that 5' to this structural gene for HI-30 
6 is an open reading frame which encodes the serum 
protein. alpha-1 -microglobulin. Alpha-1 -micro- 
globulin Is also known as protein HC. Alpha-1- 
mlcroglobulln Is a 31 kd glycoprotein found in 
plasma, urine and cerebrospinal fluid. This protein 
to is similar in structure to a protein isolated by other 
workers who designated it protein HC. For all in- 
tents and purposes, It can be considered that 
alpha-1 -microglobulin and protein HC are derived 
from the same protein and the structural differ- 
15 ences repiesent artifacts arising from the methods 
of amino acid sequence determination. Protein HC 
contains a yellow-brown chromophobe group and 
migrates as a heterogeneously charged protein 
upon agarose gel electrophoresis, even after re- 
do moval of the sialic acid carbohydrate residues with 
the enzyme neuraminidase. Alpha-1 -microglobulin 
is found in plasma both in the free form and as a 
complex with IgA. It has recently been suggested 
that alpha-1 -microglobulin Is Important in modulat- 
es Ing neutrophil chemotaxis. The association of HI-30 
with eipha-1-microgIobufln (protein HC) has not 
been previously reported or suggested in the litera- 
ture. 

30 

Summary of the Invention 

The invention disclosed herein is directed to a 
cONA sequence encoding alpha-1 -microglobulin in 
as sequence with the HI-30 coding region of the m 
fight chain. Also disclosed is a fusion protein of 
aipha-1 -microglobulin and HI-30 produced from the 
cloned DNA sequences. 

40 

Brief Description of the Drawings 

Figure 1 depicts the oligonucleotide probes 
used for isolation of the ITI light chain cONA clone. 
45 The protein sequence used for designing the 
probes Is taken from Wachter and Hachstrasser, 
1981. The sequence from domain I is MeM9 to 
Phe-68, and that from domain I) is Cys-1 15 to Cys- 
132. 

so Figure 2 depicts the nucleotide and protein 

sequence of the ITI light chain. The sequence of o- 
1 -microglobulin (o-1-mlc) begins at nucleotide 108 
and extends to nucleotide 656. Hi-30 begins at 
nucleotide 886 and extends to nucleotide 1100. 
Domain I begins at nucleotide 729; domain li starts 
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at nucleotide 897. Amino acid sequences which 
were used for designing oligonucleotide probes are 
underlined. Positions which differ from the protein 
sequence of HK30 purified from urine are enclosed 
In boxes. 

Figure 3 represents the protein structure of 
the ITI light chain. Met is the first amino acid of the 
open reading frame. "S M indicates the signal se- 
quence end AJa-Gly represents the potential signal 
peptidase cleavage site. The Arg-Arg dipeptide Is 
found at the boundary of «-1 -microglobulin and Hh 
30 regions. is the 21 amino acid sequence 
preceding the' Kunltz-like domains I and li. The Ser* 
Asn dipeptide occurs at the end of the open read- 
ing frame. 

Figure 4 depicts the Immunopreclpftation of 
the translation product of light chain mRNA. RNA 
was transcribed from the insert cloned into the 
Bluescribe© expression vector. The RNA was 
capped in vitro and translated with a mlcrococceJ 
nuclease-treated rabbit reticulocyte lysate incorpo- 
rating 35-8 methionine. Lane A is a translation with 
no RNA added and lane G is a translation of light 
chain mRNA. Lanes B, C, D are light chain RNA 
translations precipitated with non-Immune antibody, 
anti-l-galactosidase, and no antibody. Lanes E and 
F were precipitated with the anti-a-1 -microglobulin 
and the arrtMnter-a-trypsIn Inhibitor, respectively. 



Detailed Description of the invention 

The approach to the Isolation of this new gene, 
which was found to encode for both alpha-1 -micro- 
globulin and the HI-30 portion of m, was to screen 
a gene library to identify putative clones containing 
the HI-30 gene by use of specific long oligodeoxy 
ribonucleotide probes. These probes were de- 
signed by selecting amino add regions of HI-30 
with low codon degeneracy and converting (reverse 
translating) this amino, acid sequence into a unique 
ONA sequence using most preferred human 
codons, that is, codons which are used most fre- 
quently in genes cooing for human proteins. The 
protein sequences utilised In this design and the 
synthetic oligomers prepared are given in Figure 1. 
One probe represents the amino acid sequence 
from Met-48 to Phe-68 which Is located In the first 
anti-proteolytlc domain of HI-30. Another probe was 
designed based on amino acid Cys-115 to Cys- 
132, found In the second domain. Each probe was 
constructed from two 33 nucleotide long oligomers 
which overlapped by 12 complementary 
oligonucleotides at the central region. 
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Synthesis of Oligonucleotide Probes 

The oligonucleotide probes were prepared us- 
ing solid-phase synthetic methods (Atkinson et el, 
1984). The scheme for synthesis of the oligomers 
was as outlined by Matteucci et af (1081) utilizing 
proton activated, protected 2'*deoxy-ribonucieotide 
phosphoramidites (Beaucage et ai, 1981). All se- 
quential steps were performed in an automated 
manner on an Applied Blosy stems Model 380 DNA 
Synthesizer using protected nucleotides, solvents, 
chemicals and reagents, all of which were obtained 
from AppHed Blosystems, Foster City, California 
U.S.A. The solid-phase support (also from AppBed 
Blosystems) was controlled pore glass to which the 
starting ^-nucleotide was already attached. Certain 
modifications were introduced Into the automated 
reaction cycle in accordance with the manufac- 
turer's recommendations. Upon completion of the 
synthesis, the oligomers were deblocked and 
cleaved from the solid support within the ONA 
synthesizer according to the manufacturer's rec- 
ommendations. 

Removal of the blocking groups was completed 
by heating the aqueous solution containing the 
oligomer with concentrated ammonium hydroxide 
at 55° centigrade (C) for from 4 to 24 hours in a 
sealed vial. The resulting solution was evaporated, 
the residue dissolved in 0.01 (molar) M triethylanv 
monlum bicarbonate buffer, pH'7.0 (TEAS buffer). 
This solution was chromatographed over 
Sephadex-G50® Gel Filtration Resin. This column 
was prepared in, and eluted with, the same TEAB 
buffer. Material editing with the void- volume was 
pooled and the solution evaporated. A portion of 
the residue (10 to 40% of the absorbance units at 
260 nanometers), dissolved in loading buffer 
(composition: 0.1% Bromophenol Blue, 0.1% Xy- 
lene Cyanoi, 10 milllmolar dfsodium EDTA, in for- 
mamlde) was further purified by electrophoresis on 
polyacrylamlde gels. The gel size was 18 x 32 
centimeters (cm) with a thickness of 141 mflBmeters 
(mm). The well size for each oligomer purified in 
this manner was 2 to 5 cm in width and up to five 
oligomers were purified using a single get The 
concentration of acryiamJde In the gel varied from 
14 to 20%, depending on the chain length of the 
desired product. For longer oligomers, the 14% gel 
is preferred, while shorter oligomers were purified 
on up to a 20% acrylamlde gel. The gel also 
contained 7M urea and Trts-borate-EDTA buffer 
(0.1 M Trls, 0.1 M borate, 2 milllmolar EDTA, pH 
8.3). The running buffer was the same Trls-Borate- 
EDTA mixture. Electrophoresis was carried out at 
20 to 80 watts, constant power, for from 6 to 18 
hours. 
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Following completion of the electrophoresis, 
the gel was encased In plastic wrap and the 
oligomers were visualized by shadowing with ul- 
traviolet light This shadowing was accomplished 
by placing the wrapped gel on a fluorescent thin 
layer chromatography plate and viewing the gel 
with a short wave length ultraviolet light source. 
The desired product appeared as the slowest mi- 
grating, major blue band by this shadowing tech- 
nique. The desired band was excised from the gel. 
The DNA oligomer was eluted from the gei slice 
onto powdered diethylaminoethyl (DEAE) cellulose 
using an EpiGene (Baltimore. Maryland, U.S.A.) D- 
Geio electrophoresis apparatus. The oligomer was 
recovered from the cellulose by elution with 1 M 
TEAB buffer. The buffer solution containing the 
oligomer was evaporated, the residue dissolved In 
0.01 M TEAB buffer* and then desalted by passage 
over a column of Sephadex-650® as described 
previously. The material elutlng in the void volume 
was pooled and. lyophlilzed to give the final prod* 
uct Using these procedures, about 0.5 to 5.0 A260 
units of each of the purified oligomers was ob- 
tained. 

The purified oligonucleotide probes were 
radiolabeled by annealing the individual oligomers 
and fflllng-ln the remaining single stranded regions 
with Klenow fragment and radioactive nucleotides 
by the following methods. The oligomers were first 
annealed at a concentration of 200 microgram per 
milliliter (tig/ml) in 0.1 M NaCI. The mixture was 
heated at 65 °C for 10 minutes, 37° C for 20 min- 
utes, 15*C for 20 minutes and 4°C for 20 minutes- 
The probes were made radioactive by enzymatic 
fllllng-ln with Klenow fragment from E coli DNA 
polymerase I and appropriate radioactive 
nucleotides. The synthesis reaction consisted of 
02 ug annealed oligomers, 400 micrometer (uM) 
dTTP, 400 UM dATP, 100 microcurfe (uCf) <nP32- 
dCTF (3000 Curie per miinmole), 100 uCi o-P32- 
dGTP (3000 Curie per mHUmole). SO mnilmolar 
(mM) Trie (pH 7.2), 10 mM MgCfe. 0.1 mM dhhfoth- 
reitol, 50 ug/rnl bovine serum albumin, and 10 units 
Klenow fragment In a volume of 30 microliter (ixi). 
The reaction was incubated at 23'C for 00 minutes 
and then was made 400 uM with respect to un- 
labeled dCTP and dGTP. After 16 minutes at 23 °C 
the reaction was stopped with 1 uX of 10% sodium 
dodecyl sulfate (SOS) and the labeled oligomers 
purified by chromatography on Sephadex G-50®. 
Using this method, specific activities of 10 s cpm/ug 
were routinely reached. The two fllled-ln probes 
were pooled for initial screening of the library. 



Construction of mRNA Library 

A lambda gt11 cONA phage library of human 
mRNA was constructed as described by Huynh et 

s al., 1985. The library consisted of 10 6 Independent 
clones of which 71 % of the packaged phage con- 
tained inserts. The library was screened as de- 
scribed by Benton et al, 1 977. 

A total of 10* Independent phage were plated 

to on the host E, coJi strain, Y1Q88, at a density of * 
375 phage/cm 2 . The nitrocellulose filters used for 
plaque lifts were prehybridized for at least 4 hours 
at 37°C In 20% formamlde, 2X Denhardt's solution, 
5X SSPE (20X 38PE is 3.6M NaCI. 200 mM 

is NaH 2 P04* pH 7.4. 20 mM EDTA. pH 7.4), 0.1% 

SDS, 100 ag/ml sheared, single stranded salmon 
sperm DNA. The filters containing the lifted 
plaques were then hybridized overnight in the 
same conditions with 10* cpm/mi of the redlolabei- 

20 ed oligomer probe mixture. The probe mixture was 
incubated at 100°C for 10 minutes before adding 
to the filters. The filters were washed in 0.2X 
SSPE. 0.1% SDS at 37°C and exposed to Kodak 
XAR-2 film using Dupont Hl-Plus Intensifying 

25 screens at -70*C. 

By these methods, more than 100 positive 
clones were Identified In the 428,000 phage which 
were screened. Twelve positive clones were se- 
lected and purified 1 by the isolation of single phage 

30 plaques. These individual clones were then ana- 
lyzed by characterizing their melting temperature, 
I.e., temperature of thermal denaturation of the dou- 
ble stranded probe-DNA complex with each probe 
and determining the size of the inserted DNA. All 

35 twelve clones hybridized to each Individual probe. 
The probe for domain I remained hybridized to 
each of the clones at 60°C in 0.1X SSPE, 0.1% 
SDS. The probe for domain II washed off between 
50°C end 60*C. The size of the Inserts ranged 

40 from about 700 to about 1300 nucleotides as deter* 
mined by polyacrytamide gel electrophoresis of the 
cloned fragment after removal with EcoRl restric- 
tion endonudease from the Identified recombin- 
ants. The clone with the largest insert was chosen 

4a for further analysis by DNA sequencing. 



DNA Sequencing 

so The DNA fragment inserted in the phage vector 

was excised from the vector using the restriction 
enzyme, EcoRl. The released fragment was gel 
purified and used to generate a random library of 
smaller fragments in the M13 vector, mp-18, using 

ss the sonlcation technique of Deinlnger, 1983. In- 
dividual M-13 clones were sequenced by the 
dldeoxy chain termination method of Sanger et al, 
1077. Twenty random clones were first used to 
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determine the DNA sequence. Four ollgodeox- 
ynucleotide primers were then designed end syn- 
thesized. These primers were used to confirm re- 
maining regions of ambiguous sequence. Each 
nucleotide in the gene was confirmed by sequen- 
cing the region at least three times and 95% of the 
nucleotides were sequenced four times. 

Data from the shotgun DNA sequencing was 
aligned using the GEL program of Intelligenetics. 
An open reading frame search was done with the 
DMA-Protein Sequence Analysis software pur- 
chased from International Biotechnologies, Inc., 
New Haven, Connecticut, U.S. A, The DNA se- 
quence which was determined is presented in Fig- 
ure 2. Figure 2 reveals that the EcoRI fragment of 
m this clone is 1,232 nucleotides long. The fragment 
contains an open reading frame of 352 amino acids 
which initiates with a methionine codon at 
nucleotide position 51 and terminates with a stop 
codon starting at nucleotide 1108. The clone also 
contains an eight nucleotide stretch of adenylate 
residues adjacent to the EcoRI site at the 3' end. 
The signal for poly(A) addition, AATAAAA. is found 
16 nucleotides upstream from the poly (A) tail. The 
oligonucleotides used to construct the probes have 
high homology with the determined DNA sequence. 
The domain I probe has 94% homology with the 
cDNA and the domain II probe has 91 % homology. 
These high levels of homology are consistent with 
the melting characteristics of these probes. 

The complete amino acid sequence of Ht-30 
can be translated from the nucleotide codons be- 
tween bases 665 to 1100 at the 3* end of the open 
reading frame. The amino acid sequence of HI-30 
predicted by the DNA sequence is almost identical 
to that determined for HI-30 purified from human 
urine. There are minor differences at two locations. 
The previously published protein sequence assigns 
VaHle at amino acid positions 86-67 and Qlu at 
position 138. The DNA sequence determined here- 
in for the m light chain assigns an iie-Vai pair and 
a Qly at these positions, respectively. The DNA 
sequence is almost Identical to that determined 
previously by other workers for a cDNA done from 
human fiver. These workers report only the last 264 
nucleotides of a similar mRNA (Bourgulnon et al, 
1985). The- only difference between these two se- 
quences is an extra cytoslne in our sequence 
found at position 1183 In the 3* untranslated region. 

The amino terminus of the open reading frame 
contains a 205 amino acid sequence which has not 
been previously identified with either 180 kd m or 
Hl*30. To determine if this protein sequence might 
be related to that of other serum proteins, this 
sequence was used to search the National Biomed- 
ical Research Foundation Protein Sequence 
database. This database was searched with the 
(FIND program of Intelligenetics. The sequences 



were aligned after positive matches were identified 
using the AUGN program from the same vendor. 
The sequence showed an almost perfect match 
with the sequences of two proteins, aipha-1 -micro- 

6 globulin and protein HC. The sequence of protein 
HC is more complete, however. In relation to pro- 
tein HC, there is a mismatch at position 55 and a 
two amino add gap at positions 66 and 57. The 
sequence stops three amino acids short of the Arg- 

'ro Arg dipeptide adjacent to the first amino acid of the 
HK30 region. Both alpha-1 -microglobulin and pro- 
tein HC sequences, begin at position Gly-20 of the 
amino terminal domain of the m tight chain. 

Figure 3 summarizes the structure of the IT! 

rs light chain, it contains a signal sequence followed 
by the sequence of alpha-1 -microglobulin. At the 
junction of the alpha-1 -microglobulin and HI-30 
coding regions are two arglnine residues. The se- 
quence of HI-30 begins immediately after these 

20 residues and terminates with an additional Ser-Asn 
dipeptide. The protein has a calculated molecular 
weight of 42 kd which Is similar In size to the IT! 
light chain synthesized In baboon liver. 

26 

In vitro RNA Transcription 

To demonstrate the structure of this newly dis- 
covered protein, an mRNA transcript from the 

so cDNA clone of the m light chain gene was syn- 
thesized as follows. To prepare the specific mRNA, 
the m light chain clone was llgated into the EcoRI 
site of the Bluescrlbe* RNA expression vector 
which was purchased from Stratagene Cloning 

35 Systems, Inc. (San Diego, California, U.SJV) This 
construct was cut with Hindlil and then transcribed 
in vitro using T7 RNA polymerase according to the 
method of Krteg et al, 1884. This produced an 
mRNA copy of the m light chain DNA insert 

40 Analysis of the transcription product by gei elec- 
trophoresis showed that the mRNA was a full 
length copy of the DNA. The mRNA transcript was 
then extracted with phenol and ethanol precipitated. 
Prior to translation, infra, the RNA tr ans cript was 

45 capped with vaccinia virus guarrytyitransferase. The 
reaction mixture contained 50 uM Trie (pH 7.9), 
1.25 UM MgCfe. 6 mM KCU 2£ mM dithtothreftol, 
100 ug/ml bovine serum albumin, 100 uM 8-ad- 
enosyi methionine and 330 uM OTP. One unit of 

so enzyme, which was purchased from Bethesda Re- 
search Laboratories, was used per ug of mRNA. 
The reaction was carried out at 37* C for 45 min- 
utes, it was terminated by the addition of SDS (to 
0.5%) and then phenol extracted. 

68 
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In vitro-Translation and Protein Production 

Capped mRNAs produced in the above reac- 
tion were translated to the corresponding protein In 
the micrococcaJ nuclease-treated rabbit reticulocyte 
lysate translation system purchased from Promega 
Biotech.,* Madison. Wisconsin", U.6.A. The transla- 
tions were performed as described by Pelham et 
alp 1076. 

The protein product thus obtained was then 
challenged with antibodies to aipha-1 -microglobulin 
and to inter-alpha-trypsin inhibitor. These rabbit 
antibodies (IgG fraction) were purchased from* Ac- 
curate Chemical and Scientific Corporation 
(Westbury, New York, U.S.A.) Immuno-precipHa- 
tions were performed as described by Dobbersteln 
et a), 1979. The immuno-preclpltated products 
were characterized by electrophoresis on a 
polyacrylamide gel as described by Laemmli, 
1970. Samples were boiled for 10 minutes in 4% 
SDS and 50 mM dithlothreltoi before loading onto 
the gel. The results of these experiments are 
shown in Figure 4. 

The in vitro translation product is a 42 kd 
protein (lane Q) which is consistent with the size 
predicted by the open reading frame In the newly 
cloned gene. The protein band is slightly distorted 
by co-migrating protein in the lysate. The new 
protein product is precipitated with antibodies to 
both alpha-1 -microglobulin and 180 kd m (lanes E 
and F). No significant amount of protein product is 
precipitated by non-immune antibody, anti-beta* 
galactosidase antibody or when no antibody is ad- 
ded' (lanes B, C, and D, respectively). Thus the m 
light chain mRNA encodes for a new protein con* 
tainlng the amino acid sequences of aJpha-1 -micro- 
globulin with HI-30. Both regions of the fusion 
protein product are apparently properly folded to 
present epitopes which are recognized by specific 
antibodies made to mature 31 kd alpha-1 -micro- 
globulin and the high molecular weight form of ITt. 

The skilled artisan will appreciate that protein 
production utilizing the cDNA for the m light chain 
may also be accomplished In vivo. For example, 
the cDNA of the m light chain may be Incor- 
porated into a suitable cloning vector (l.e.. transfer 
vector) such as a plasmld or the DNA of certain 
bacteriophage. Plasmid vectors contain (or may be 
engineered to contain) various restriction sites suit- 
able for opening the circularized plasmid structure 
and the subsequent ligation of the DMA sequence 
into said vector. Bacteriophage DNA may carry an 
Inserted segment of heterologous DNA in place of 
certain nonessential phage genes* In either in- 
stance, the vector provides a means for the in- 
troduction of the heterologous sequence into- the 
host, said vector also carrying the genetic Informa- 
tion necessary to provide for serf-replication there- 



in. The host cell may be selected from among the 
various procaryotic and eucaryotic hosts now con- 
ventional in the art which are used for gene expres- 
sion. Such organisms include, for example, various 
s strains of E. coll, Bacillus subtills , Saccharomyces 
gerevisiae . and the tike. Further, the host may be 
mammalian in nature and include such cell lines as 
baby hamster kidney ceils, Chinese hamster ovary 
cells, He La and VEHO ceil lines. 

io Such in vivo protein production may be exem- 
plified as follows. The cDNA sequence encoding 
the m Gght chain is ligated into the unique EcoRl 
site of the expression vector pKK223-3 which is 
commercially available from Pharmacia, Inc. 

75 (Molecular Biologlcals Catalog, p. 63, 1864). Inser- 
tion of the cONA sequence of the IT! light chain in 
the correct orientation Is then determined by re- 
striction mapping techniques resulting in a recom- 
binant plasmid. Said plasmid is then used to trans- 

20 form §. coil JM105 cells by conventional tech- 
niques? The transformed bacteria are then cultured 
under conditions suitable for growth. The expres- 
sion of the cDNA encoding the m light chain is 
achieved by Induction with isopropyM*thiogalao» 

25 toside. Processing of the microorganism, by tech- 
niques standard in the art is followed by subse- 
quent Isolation of the resultant a- 1 -microglobulin' 
Hl-30 fusion protein. 

As noted previously, ITI is a serine protease 

30 inhibitor found in human plasma and serum. This 
protein has been reported to inhibit the proteolytic 
enzymes trypsin, chymotrypsln and neutrophil 
elastase. The most important proteinase inhibitory 
activity, from a therapeutic standpoint is the inhlbl- 

36 tion of the hydro lytic enzyme elastase which may 
be released from several sources within the human 
body, m can be used therapeutically to treat dis- 
eases which result from an excessive release of 
hydrolytlc enzymes, especially elastase from var- 

40 ious sources. For example, an excessive release of 
pancreatic elastase results in pancreatitis. Exces- 
sive serum elastase levels have been Implicated in 
atherosclerosis and it Is well known that release of 
excessive neutrophil -elastase Is observed in acute 

45 and chronic inflammation resulting In damage to ail 
connective tissue, blood vessel wails and in 
necrosis and degeneration of lung tissue. Equally 
important is the part played by lysosomal en- 
zymes, In particular neutrophil elastase, in frrflam- 

50 m story reactions due to immunological processes, 
for example, rheumatoid arthritis. 
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(b) preparing radiolabeled oligonucleotide probes 
complementary to said HI-30 coding region; (c) 
using said probes to screen said library of step (a); 
and (d) isolating from said library said cDNA se- 

5 quence encoding alpha-1 -microglobulin in . se- 
quence with the HI-30 coding region of the inter- 
aJpha-trypsin Inhibitor light chain. 

32. The method of the claim 1 wherein the 
cDNA sequence encoding alpha-1 -microglobulin in 

70 sequence with the HI-30 coding region of the Inter- 
aipha-trypsin Inhibitor light chain additionally con- 
tains a DNA sequence coding for a secretory lead- 
er sequence 5' to said cONA sequence. 

15 



so 



25 



30 



Claims 



1. A cDNA sequence encoding aJ p ha- 1 -micro- as 
globulin in sequence with the HI-30 coding region 

of the Inter-alpha-trypsin inhibitor light chain. 

2. The cDNA sequence of Claim 1 additionally 
comprising a DNA sequence coding for a secretory 
leader sequence 5' to said cONA sequence. 40 

3. A recombinant DNA cloning vector compris- 
ing the cDNA sequence of Claim 1. 

4. A recombinant DNA cloning vector compris- 
ing the cDNA sequence of Claim 2, 

5. A microorganism transformed by the cloning 46 
vector of Claim 3. 

6. A microorganism transformed by the cloning 
vector of Claim 4. 

7. A fusion protein of the fnter-alphe-trypsln 
light chain comprising 0-1 -microglobulin and HI-30. so 

Claims for the following Contracting State : AT 

1. A metrfod for isolating a cONA sequence 
encoding alphe-1-rnlcroglobulln in sequence with 66 
the HI-30 coding region of the inter-alpha-trypsin 
Inhibitor fight chain comprising the steps ot (a) 
preparing a cDNA phage library of human mRNA; 
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FIG. 1 



DOMAIN X 

Mot Ala Cyo Glu Tbr Phe 
-ATG GCC Tac OAG ACC TTC 

DOMAIN ZZ 

Cyo Gin Gly Asn Gly Aon 
-TGC GAG GGC AAC GCC AAC 



Gin Tyr Gly Gly cyo Hot 

OAG TAC OGC QGC TGC-3 » 
3»-ATG OCG CCG ACG TAC 

Lyo ptxm Tyr Sor Glu Lyo 

AAG TTC TAC ACC CAO-3 * 
3 ' -AAG .ATG TCG GTC TTC 



Gly Asn Gly Aon Asn Pho 
CCG TTO CCG TTG TTO AAG— B ' 

Clu Cyo Arc Gin Tyr Cyo 

CTC ACQ TCT CTC ATG ACG-5 ' 
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FIG. 2 



10 20 30 40 50 60 

' • • • m m 

AA TIC TAG ACC GAG CCT GTG GGA TAX ACC AAG GCA GAG GAG CCC AiA GCC ATO AOS ACC CTC 

Met Arg Ser Leu 

3° - 2° 2° 100 a-1-mic 120 

* ■ ■ I Ml iTfr * 

GOG GOC CTC CTC TTG CTQ CTO ACC GCC TGC CTG OCO GTG AGC OCT GCC CCT <HXJ CCA ACQ 
Gly Ala Leu Leu Lea Leu Ln Ser Aim Cye Leu Ale. VfcQ Ser Ala! Gly Pro Ve4 Pro Thr 



1^0 140 150 160 170 

• * • w * 

2? **** CAO ^AT AIC TCT OGO ATC TAX GGQ AAC TOG 

PraProAspAamlleGlnVSilGla OOu Aan Pha Aen lie Ser Arg tie Tyr Gly Lye Trp 

iso 2oo aio aao a3o 240 

• ■ * » ■ « 

JAC AAC CTQ GCC ATC GOT TGC ACC TGC CCC TOO CIO AAG AAC ATC ATG GAC AGO ATC ACA 
Tyrtot»AO*llAQlySer»rCy»PrtiTrpi«iA«I^ll« 

260 360 37Q 



Vkl Ser Thr Vol Leu Oly Gla Gly Ale Ha* Glu Al* Olu Xle Ser Met Thr Sex Thr 
310 320 330 040 



Arg Trp Arg Lye Oly vei Cys Glu Clu Thr Ser Oly AOs Tyr Olu L^ Ihr Asp Thr Asp 

370 380 30O 400 410 420 

* * m m 

Oly Lye Pbe teu Tyr Elo Lye Ser lye Trp Asn lis Thr Met Olu Ser Tyr VfcQ Vai Hie 

430 *«> 480 460 470 460 

• • • « « 

ACC AAC TAX GAT GAG TAT GCC ATT TTC CIO ACC AAO AAA TIC AGC CGC CAT CAT GBA CCC 
Uuf Asa Tyr Asp Glu Tyr Ala lie Phe Leu Thr Lya Lye Pim Ser Arg His Hie Oly Pro 

490 600 610 520 630 640 

■ ■ • • • > 

JE ^S?? 8 ^ m ™ « « ow ew » jw cxc» 

Thr tie Thr Ale Lye Leu Tyr Oly Arg AO*. Pro Gin Leu- Arg Olu Thr Leu Leu Gin Asp 

WO 660 070 680 600 600 
« 

TTC AGA GTQ OTT GCC CAG GGT GTS GOC AIC CCT GAG GAC TCC ATC TTC ACC ATO GCT fine 

«»ArgVeaveiAO«olnQlyVelCUyIloProGluA»p8«rllePte 

010 620 630 040 660 660 

" m •mm « 

COA GCT GAA TOT GTC CCT CGG GAG CAG GAA CCA GAG CCC ATC TTA ATC COO AGA GTC CC3 

Arg Oly Glu eye VW. Pro Oly Olu Gin olu Pro Olu Pro lie Leu lie Pro Arg Vfcl Arg 
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FIG. 2 (continued) 



HI -30 



6flO 



690 



AOQICCT GTQ CTA CCC CAA OAA GAG GAA GGA TCA 
ArgjAla Vel Leu Pro Gin Olu Glu Glu Gly Ser 



I 



740 



760 



700 

GGG COT QGQ CAA 
Oly Gly aiy Gin 

760 



710 



720 



CIC GTA ACT GAA GTC 
Lea Vhl TUT Glu Vol 



770 



780 



AOC AAG j AAA GAA GAT ICO TGC CAG CTO GGC TAC 

ttr Lye Ly» Glu Asp Ser Cye Gin Leu Oly Tyr Ser Ale aiy Pro 



790 



600 



810 



AOC ACQ TAT TIC TAT AAT GOT ACA TOO AT9 GCC 

ser Arg Tyr Pfce Tyr An Gly Ttsr Ser tet Ala 



TOT GAG ACT TTC 

Cye Olu Tar Pbe 



TGC ATG GGA ATS ACC 
Cye Met Gly Met ttr 



840 



CAS TAC GSC GGC TGC 
Qjn Tyr Gly Gly eye 



ATG GGC AAC 

Mat Gly Aan 



860 

GOT AAC AAC TTC 
Oly A<n Aan ?fce 



910 

GCQ GCC TGC 
Alt Al» Cya 

970 

TTT GAT OCT 
E&e Asp Ala 

1030 
m 

AAC AAG TTC 
An Lye °hm 



920 



870 

GTC ACA GAA AAG 
Val Thr Qlu Lye 

930 



AAT CTC CCC ATA 
An Lw Pro I lie 



G7CICGG GOD COO 

veil Arc aiy Pro 



GXO AAG GQG AAS 

Val Lys Gly Lys 

1040 
■ 

TAC TCA GAG AAG 
Tyr Ser Glu Lye 



990 

TGC GTC CTC TTC 
eye Val Leu Pfce 

1050 
* 

GAG TGC AGA GAG 
Qlu eye Arg Glu 



880 
* 

(SAG TOT CTO CAG 
Glu Cya Lsu Gin 

940 

TGC GGA GCC TTC 
eye Arg Ale Ffce 

1000 

CCC TAC GOG GGC 

Pro Tyr Gly Gly 

108O 
TAC TGC GOT GTC 

TyrCye Gly Val 




ATC CAG CTC TGG GCA 
lie Gin Leu Trp Ale 

1010 1020 
■ • 

TGC CAG GGC AAC GGG 

Cye Gin Gly An Gly 



1070 

OCT GST GAT 
Pro Gly Aep 



1080 



GOT 

Oly 



GAT 
Aep 



1100 



1110 



1120 



GAG GAG CTO 
Glu GlU Leu 

1150 
m 

GTG TOT GTC 



CTO COG TTC TOO AAC TGA CAA CTO GCQ GOT CTG CAA 
Leu Arg Ptm Ser 



1160 



1170 
* 



11SO 
CAA GCA ACC TOO 



1130 1140 
• • • 

QIC AGA GGA TOO CCA 

1190 1200 
CTC CAA ATA AAA ACT 



1210 1220 1230 

« * ■ 

AAA TTG TAA ACT OCT GAA AAA AAA GGA ATT 
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